Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 2500 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 495.4 KiB |
| Average record size in memory | 202.9 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 1 |
area is highly overall correlated with perimeter and 4 other fields | High correlation |
perimeter is highly overall correlated with area and 7 other fields | High correlation |
major_axis_length is highly overall correlated with area and 8 other fields | High correlation |
minor_axis_length is highly overall correlated with area and 7 other fields | High correlation |
convex_area is highly overall correlated with area and 4 other fields | High correlation |
equiv_diameter is highly overall correlated with area and 4 other fields | High correlation |
eccentricity is highly overall correlated with major_axis_length and 5 other fields | High correlation |
roundness is highly overall correlated with perimeter and 8 other fields | High correlation |
aspect_ration is highly overall correlated with perimeter and 7 other fields | High correlation |
compactness is highly overall correlated with perimeter and 7 other fields | High correlation |
class is highly overall correlated with perimeter and 6 other fields | High correlation |
solidity is highly overall correlated with roundness | High correlation |
extent is highly overall correlated with roundness and 2 other fields | High correlation |
Reproduction
| Analysis started | 2022-12-06 23:03:58.603231 |
|---|---|
| Analysis finished | 2022-12-06 23:04:36.261762 |
| Duration | 37.66 seconds |
| Software version | pandas-profiling vv3.5.0 |
| Download configuration | config.json |
area
Real number (ℝ)
| Distinct | 2424 |
|---|---|
| Distinct (%) | 97.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 80658.221 |
| Minimum | 47939 |
|---|---|
| Maximum | 136574 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 47939 |
|---|---|
| 5-th percentile | 60774.85 |
| Q1 | 70765 |
| median | 79076 |
| Q3 | 89757.5 |
| 95-th percentile | 104823.8 |
| Maximum | 136574 |
| Range | 88635 |
| Interquartile range (IQR) | 18992.5 |
Descriptive statistics
| Standard deviation | 13664.51 |
|---|---|
| Coefficient of variation (CV) | 0.16941249 |
| Kurtosis | 0.12899636 |
| Mean | 80658.221 |
| Median Absolute Deviation (MAD) | 9278.5 |
| Skewness | 0.49599901 |
| Sum | 2.0164555 × 108 |
| Variance | 1.8671884 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75637 | 3 | 0.1% |
| 97268 | 3 | 0.1% |
| 68063 | 3 | 0.1% |
| 96928 | 2 | 0.1% |
| 76461 | 2 | 0.1% |
| 72953 | 2 | 0.1% |
| 74431 | 2 | 0.1% |
| 74336 | 2 | 0.1% |
| 85054 | 2 | 0.1% |
| 88634 | 2 | 0.1% |
| Other values (2414) | 2477 |
| Value | Count | Frequency (%) |
| 47939 | 1 | |
| 48098 | 1 | |
| 49171 | 1 | |
| 49273 | 1 | |
| 49673 | 1 | |
| 50475 | 1 | |
| 50670 | 1 | |
| 50731 | 1 | |
| 50822 | 1 | |
| 51555 | 1 |
| Value | Count | Frequency (%) |
| 136574 | 1 | |
| 135455 | 1 | |
| 132035 | 1 | |
| 130913 | 1 | |
| 130071 | 1 | |
| 127033 | 1 | |
| 126963 | 1 | |
| 125949 | 1 | |
| 125697 | 1 | |
| 125214 | 1 |
perimeter
Real number (ℝ)
| Distinct | 2490 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1130.279 |
| Minimum | 868.485 |
|---|---|
| Maximum | 1559.45 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 868.485 |
|---|---|
| 5-th percentile | 964.9201 |
| Q1 | 1048.8297 |
| median | 1123.672 |
| Q3 | 1203.3405 |
| 95-th percentile | 1320.3929 |
| Maximum | 1559.45 |
| Range | 690.965 |
| Interquartile range (IQR) | 154.51075 |
Descriptive statistics
| Standard deviation | 109.25642 |
|---|---|
| Coefficient of variation (CV) | 0.096663228 |
| Kurtosis | -0.021849642 |
| Mean | 1130.279 |
| Median Absolute Deviation (MAD) | 76.7115 |
| Skewness | 0.41453885 |
| Sum | 2825697.5 |
| Variance | 11936.965 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1206.002 | 2 | 0.1% |
| 1014.49 | 2 | 0.1% |
| 1187.56 | 2 | 0.1% |
| 1023.719 | 2 | 0.1% |
| 1192.175 | 2 | 0.1% |
| 1134.729 | 2 | 0.1% |
| 963.377 | 2 | 0.1% |
| 1217.112 | 2 | 0.1% |
| 1253.276 | 2 | 0.1% |
| 1103.068 | 2 | 0.1% |
| Other values (2480) | 2480 |
| Value | Count | Frequency (%) |
| 868.485 | 1 | |
| 871.458 | 1 | |
| 884.106 | 1 | |
| 888.242 | 1 | |
| 889.398 | 1 | |
| 895.169 | 1 | |
| 899.493 | 1 | |
| 899.532 | 1 | |
| 902.59 | 1 | |
| 903.456 | 1 |
| Value | Count | Frequency (%) |
| 1559.45 | 1 | |
| 1520.525 | 1 | |
| 1492.183 | 1 | |
| 1491.946 | 1 | |
| 1490.954 | 1 | |
| 1476.738 | 1 | |
| 1468.224 | 1 | |
| 1465.654 | 1 | |
| 1454.583 | 1 | |
| 1453.922 | 1 |
major_axis_length
Real number (ℝ)
| Distinct | 2499 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 456.60184 |
| Minimum | 320.8446 |
|---|---|
| Maximum | 661.9113 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 320.8446 |
|---|---|
| 5-th percentile | 376.25062 |
| Q1 | 414.95785 |
| median | 449.4966 |
| Q3 | 492.73765 |
| 95-th percentile | 556.34869 |
| Maximum | 661.9113 |
| Range | 341.0667 |
| Interquartile range (IQR) | 77.7798 |
Descriptive statistics
| Standard deviation | 56.235704 |
|---|---|
| Coefficient of variation (CV) | 0.12316136 |
| Kurtosis | -0.015689806 |
| Mean | 456.60184 |
| Median Absolute Deviation (MAD) | 38.3324 |
| Skewness | 0.50297956 |
| Sum | 1141504.6 |
| Variance | 3162.4544 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 465.7347 | 2 | 0.1% |
| 539.6806 | 1 | < 0.1% |
| 584.4799 | 1 | < 0.1% |
| 514.3802 | 1 | < 0.1% |
| 424.5284 | 1 | < 0.1% |
| 561.5072 | 1 | < 0.1% |
| 473.3268 | 1 | < 0.1% |
| 607.8398 | 1 | < 0.1% |
| 441.2244 | 1 | < 0.1% |
| 622.8818 | 1 | < 0.1% |
| Other values (2489) | 2489 |
| Value | Count | Frequency (%) |
| 320.8446 | 1 | |
| 324.0113 | 1 | |
| 326.1485 | 1 | |
| 328.2684 | 1 | |
| 329.9696 | 1 | |
| 331.6936 | 1 | |
| 334.1895 | 1 | |
| 340.6951 | 1 | |
| 342.3154 | 1 | |
| 342.3836 | 1 |
| Value | Count | Frequency (%) |
| 661.9113 | 1 | |
| 648.9984 | 1 | |
| 648.4012 | 1 | |
| 640.1907 | 1 | |
| 632.2535 | 1 | |
| 632.108 | 1 | |
| 629.723 | 1 | |
| 625.3347 | 1 | |
| 623.0155 | 1 | |
| 622.8818 | 1 |
minor_axis_length
Real number (ℝ)
| Distinct | 2497 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 225.79492 |
| Minimum | 152.1718 |
|---|---|
| Maximum | 305.818 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 152.1718 |
|---|---|
| 5-th percentile | 188.26277 |
| Q1 | 211.24592 |
| median | 224.7031 |
| Q3 | 240.67287 |
| 95-th percentile | 266.12709 |
| Maximum | 305.818 |
| Range | 153.6462 |
| Interquartile range (IQR) | 29.42695 |
Descriptive statistics
| Standard deviation | 23.297245 |
|---|---|
| Coefficient of variation (CV) | 0.10317878 |
| Kurtosis | 0.073234814 |
| Mean | 225.79492 |
| Median Absolute Deviation (MAD) | 14.5294 |
| Skewness | 0.10430328 |
| Sum | 564487.3 |
| Variance | 542.7616 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 221.2116 | 2 | 0.1% |
| 220.6852 | 2 | 0.1% |
| 229.4863 | 2 | 0.1% |
| 220.2388 | 1 | < 0.1% |
| 227.7773 | 1 | < 0.1% |
| 200.5012 | 1 | < 0.1% |
| 226.9396 | 1 | < 0.1% |
| 203.6701 | 1 | < 0.1% |
| 232.3653 | 1 | < 0.1% |
| 197.3337 | 1 | < 0.1% |
| Other values (2487) | 2487 |
| Value | Count | Frequency (%) |
| 152.1718 | 1 | |
| 154.002 | 1 | |
| 154.5346 | 1 | |
| 154.7253 | 1 | |
| 155.4211 | 1 | |
| 156.1008 | 1 | |
| 160.6267 | 1 | |
| 162.796 | 1 | |
| 163.8458 | 1 | |
| 164.7038 | 1 |
| Value | Count | Frequency (%) |
| 305.818 | 1 | |
| 300.5777 | 1 | |
| 297.7952 | 1 | |
| 296.2779 | 1 | |
| 293.4921 | 1 | |
| 293.47 | 1 | |
| 292.9598 | 1 | |
| 292.6174 | 1 | |
| 292.53 | 1 | |
| 292.4844 | 1 |
convex_area
Real number (ℝ)
| Distinct | 2445 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81508.084 |
| Minimum | 48366 |
|---|---|
| Maximum | 138384 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 48366 |
|---|---|
| 5-th percentile | 61477.9 |
| Q1 | 71512 |
| median | 79872 |
| Q3 | 90797.75 |
| 95-th percentile | 105956.45 |
| Maximum | 138384 |
| Range | 90018 |
| Interquartile range (IQR) | 19285.75 |
Descriptive statistics
| Standard deviation | 13764.093 |
|---|---|
| Coefficient of variation (CV) | 0.16886782 |
| Kurtosis | 0.12302642 |
| Mean | 81508.084 |
| Median Absolute Deviation (MAD) | 9346 |
| Skewness | 0.49401595 |
| Sum | 2.0377021 × 108 |
| Variance | 1.8945025 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 77745 | 3 | 0.1% |
| 76640 | 3 | 0.1% |
| 89255 | 2 | 0.1% |
| 54979 | 2 | 0.1% |
| 70045 | 2 | 0.1% |
| 80649 | 2 | 0.1% |
| 74727 | 2 | 0.1% |
| 87868 | 2 | 0.1% |
| 79445 | 2 | 0.1% |
| 67537 | 2 | 0.1% |
| Other values (2435) | 2478 |
| Value | Count | Frequency (%) |
| 48366 | 1 | |
| 48643 | 1 | |
| 49739 | 1 | |
| 50268 | 1 | |
| 50306 | 1 | |
| 51092 | 1 | |
| 51230 | 1 | |
| 51385 | 1 | |
| 51648 | 1 | |
| 52013 | 1 |
| Value | Count | Frequency (%) |
| 138384 | 1 | |
| 136373 | 1 | |
| 133706 | 1 | |
| 131934 | 1 | |
| 131713 | 1 | |
| 127906 | 1 | |
| 127781 | 1 | |
| 126962 | 1 | |
| 126538 | 1 | |
| 126196 | 1 |
equiv_diameter
Real number (ℝ)
| Distinct | 2424 |
|---|---|
| Distinct (%) | 97.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 319.33423 |
| Minimum | 247.0584 |
|---|---|
| Maximum | 417.0029 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 247.0584 |
|---|---|
| 5-th percentile | 278.17431 |
| Q1 | 300.16798 |
| median | 317.30535 |
| Q3 | 338.05737 |
| 95-th percentile | 365.32967 |
| Maximum | 417.0029 |
| Range | 169.9445 |
| Interquartile range (IQR) | 37.8894 |
Descriptive statistics
| Standard deviation | 26.89192 |
|---|---|
| Coefficient of variation (CV) | 0.084212456 |
| Kurtosis | -0.14670252 |
| Mean | 319.33423 |
| Median Absolute Deviation (MAD) | 18.6827 |
| Skewness | 0.27186759 |
| Sum | 798335.58 |
| Variance | 723.17535 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 310.3289 | 3 | 0.1% |
| 351.9168 | 3 | 0.1% |
| 294.3816 | 3 | 0.1% |
| 351.3012 | 2 | 0.1% |
| 312.0147 | 2 | 0.1% |
| 304.7731 | 2 | 0.1% |
| 307.8449 | 2 | 0.1% |
| 307.6484 | 2 | 0.1% |
| 329.0807 | 2 | 0.1% |
| 335.935 | 2 | 0.1% |
| Other values (2414) | 2477 |
| Value | Count | Frequency (%) |
| 247.0584 | 1 | |
| 247.4677 | 1 | |
| 250.2128 | 1 | |
| 250.4722 | 1 | |
| 251.4868 | 1 | |
| 253.5089 | 1 | |
| 253.9981 | 1 | |
| 254.151 | 1 | |
| 254.3788 | 1 | |
| 256.2067 | 1 |
| Value | Count | Frequency (%) |
| 417.0029 | 1 | |
| 415.2911 | 1 | |
| 410.0149 | 1 | |
| 408.269 | 1 | |
| 406.954 | 1 | |
| 402.1734 | 1 | |
| 402.0626 | 1 | |
| 400.4538 | 1 | |
| 400.053 | 1 | |
| 399.2836 | 1 |
eccentricity
Real number (ℝ)
| Distinct | 1295 |
|---|---|
| Distinct (%) | 51.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8608794 |
| Minimum | 0.4921 |
|---|---|
| Maximum | 0.9481 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0.4921 |
|---|---|
| 5-th percentile | 0.783195 |
| Q1 | 0.8317 |
| median | 0.8637 |
| Q3 | 0.897025 |
| 95-th percentile | 0.924305 |
| Maximum | 0.9481 |
| Range | 0.456 |
| Interquartile range (IQR) | 0.065325 |
Descriptive statistics
| Standard deviation | 0.045167399 |
|---|---|
| Coefficient of variation (CV) | 0.052466581 |
| Kurtosis | 1.7942093 |
| Mean | 0.8608794 |
| Median Absolute Deviation (MAD) | 0.0327 |
| Skewness | -0.74862334 |
| Sum | 2152.1985 |
| Variance | 0.0020400939 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.8915 | 7 | 0.3% |
| 0.8987 | 7 | 0.3% |
| 0.8834 | 7 | 0.3% |
| 0.8495 | 7 | 0.3% |
| 0.835 | 6 | 0.2% |
| 0.8433 | 6 | 0.2% |
| 0.8985 | 6 | 0.2% |
| 0.8504 | 6 | 0.2% |
| 0.8914 | 6 | 0.2% |
| 0.8828 | 6 | 0.2% |
| Other values (1285) | 2436 |
| Value | Count | Frequency (%) |
| 0.4921 | 1 | |
| 0.6586 | 1 | |
| 0.686 | 1 | |
| 0.688 | 1 | |
| 0.6903 | 1 | |
| 0.6915 | 1 | |
| 0.6944 | 1 | |
| 0.708 | 1 | |
| 0.7105 | 1 | |
| 0.7128 | 1 |
| Value | Count | Frequency (%) |
| 0.9481 | 1 | |
| 0.9464 | 1 | |
| 0.9457 | 1 | |
| 0.9448 | 1 | |
| 0.9443 | 1 | |
| 0.943 | 1 | |
| 0.9429 | 1 | |
| 0.9428 | 1 | |
| 0.942 | 1 | |
| 0.9415 | 1 |
solidity
Real number (ℝ)
| Distinct | 166 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9894916 |
| Minimum | 0.9186 |
|---|---|
| Maximum | 0.9944 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0.9186 |
|---|---|
| 5-th percentile | 0.9841 |
| Q1 | 0.9883 |
| median | 0.9903 |
| Q3 | 0.9915 |
| 95-th percentile | 0.9928 |
| Maximum | 0.9944 |
| Range | 0.0758 |
| Interquartile range (IQR) | 0.0032 |
Descriptive statistics
| Standard deviation | 0.0034935924 |
|---|---|
| Coefficient of variation (CV) | 0.0035306943 |
| Kurtosis | 81.121646 |
| Mean | 0.9894916 |
| Median Absolute Deviation (MAD) | 0.0015 |
| Skewness | -5.6910091 |
| Sum | 2473.729 |
| Variance | 1.2205188 × 10-5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.9905 | 62 | 2.5% |
| 0.9912 | 61 | 2.4% |
| 0.9916 | 57 | 2.3% |
| 0.9911 | 56 | 2.2% |
| 0.9906 | 55 | 2.2% |
| 0.9918 | 54 | 2.2% |
| 0.9904 | 54 | 2.2% |
| 0.9909 | 52 | 2.1% |
| 0.9913 | 51 | 2.0% |
| 0.9914 | 50 | 2.0% |
| Other values (156) | 1948 |
| Value | Count | Frequency (%) |
| 0.9186 | 1 | |
| 0.9542 | 1 | |
| 0.9567 | 1 | |
| 0.9582 | 1 | |
| 0.9639 | 1 | |
| 0.9661 | 1 | |
| 0.9699 | 1 | |
| 0.9702 | 1 | |
| 0.972 | 1 | |
| 0.9728 | 1 |
| Value | Count | Frequency (%) |
| 0.9944 | 1 | < 0.1% |
| 0.9943 | 1 | < 0.1% |
| 0.9939 | 2 | 0.1% |
| 0.9938 | 6 | 0.2% |
| 0.9937 | 8 | 0.3% |
| 0.9936 | 12 | |
| 0.9935 | 8 | 0.3% |
| 0.9934 | 12 | |
| 0.9933 | 6 | 0.2% |
| 0.9932 | 22 |
extent
Real number (ℝ)
| Distinct | 1392 |
|---|---|
| Distinct (%) | 55.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.69320452 |
| Minimum | 0.468 |
|---|---|
| Maximum | 0.8296 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0.468 |
|---|---|
| 5-th percentile | 0.56768 |
| Q1 | 0.6589 |
| median | 0.71305 |
| Q3 | 0.740225 |
| 95-th percentile | 0.7623 |
| Maximum | 0.8296 |
| Range | 0.3616 |
| Interquartile range (IQR) | 0.081325 |
Descriptive statistics
| Standard deviation | 0.060913648 |
|---|---|
| Coefficient of variation (CV) | 0.087872549 |
| Kurtosis | 0.42498155 |
| Mean | 0.69320452 |
| Median Absolute Deviation (MAD) | 0.03345 |
| Skewness | -1.0265683 |
| Sum | 1733.0113 |
| Variance | 0.0037104725 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.7249 | 9 | 0.4% |
| 0.7445 | 8 | 0.3% |
| 0.7403 | 8 | 0.3% |
| 0.7435 | 7 | 0.3% |
| 0.7379 | 7 | 0.3% |
| 0.7201 | 7 | 0.3% |
| 0.7325 | 7 | 0.3% |
| 0.7424 | 7 | 0.3% |
| 0.7393 | 7 | 0.3% |
| 0.7189 | 6 | 0.2% |
| Other values (1382) | 2427 |
| Value | Count | Frequency (%) |
| 0.468 | 1 | |
| 0.4695 | 1 | |
| 0.4822 | 1 | |
| 0.4843 | 1 | |
| 0.4888 | 1 | |
| 0.495 | 1 | |
| 0.497 | 1 | |
| 0.4977 | 1 | |
| 0.5 | 1 | |
| 0.5005 | 1 |
| Value | Count | Frequency (%) |
| 0.8296 | 1 | |
| 0.7993 | 1 | |
| 0.7954 | 1 | |
| 0.7879 | 1 | |
| 0.7831 | 1 | |
| 0.7824 | 1 | |
| 0.7814 | 1 | |
| 0.781 | 1 | |
| 0.7808 | 1 | |
| 0.7801 | 1 |
roundness
Real number (ℝ)
| Distinct | 1480 |
|---|---|
| Distinct (%) | 59.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.79153276 |
| Minimum | 0.5546 |
|---|---|
| Maximum | 0.9396 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0.5546 |
|---|---|
| 5-th percentile | 0.6922 |
| Q1 | 0.7519 |
| median | 0.79775 |
| Q3 | 0.834325 |
| 95-th percentile | 0.873715 |
| Maximum | 0.9396 |
| Range | 0.385 |
| Interquartile range (IQR) | 0.082425 |
Descriptive statistics
| Standard deviation | 0.055923947 |
|---|---|
| Coefficient of variation (CV) | 0.070652725 |
| Kurtosis | -0.239235 |
| Mean | 0.79153276 |
| Median Absolute Deviation (MAD) | 0.04015 |
| Skewness | -0.37268712 |
| Sum | 1978.8319 |
| Variance | 0.0031274878 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.7609 | 7 | 0.3% |
| 0.806 | 6 | 0.2% |
| 0.8267 | 6 | 0.2% |
| 0.7749 | 6 | 0.2% |
| 0.7933 | 6 | 0.2% |
| 0.8357 | 6 | 0.2% |
| 0.8028 | 6 | 0.2% |
| 0.835 | 6 | 0.2% |
| 0.7413 | 5 | 0.2% |
| 0.781 | 5 | 0.2% |
| Other values (1470) | 2441 |
| Value | Count | Frequency (%) |
| 0.5546 | 1 | |
| 0.5825 | 1 | |
| 0.6153 | 1 | |
| 0.6226 | 1 | |
| 0.627 | 1 | |
| 0.6327 | 1 | |
| 0.6338 | 1 | |
| 0.6374 | 1 | |
| 0.6391 | 1 | |
| 0.6426 | 1 |
| Value | Count | Frequency (%) |
| 0.9396 | 1 | |
| 0.9255 | 1 | |
| 0.9233 | 1 | |
| 0.9221 | 1 | |
| 0.9214 | 1 | |
| 0.9193 | 1 | |
| 0.9162 | 1 | |
| 0.9161 | 1 | |
| 0.916 | 1 | |
| 0.9156 | 1 |
aspect_ration
Real number (ℝ)
| Distinct | 2237 |
|---|---|
| Distinct (%) | 89.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0417023 |
| Minimum | 1.1487 |
|---|---|
| Maximum | 3.1444 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 1.1487 |
|---|---|
| 5-th percentile | 1.60829 |
| Q1 | 1.80105 |
| median | 1.9842 |
| Q3 | 2.262075 |
| 95-th percentile | 2.620525 |
| Maximum | 3.1444 |
| Range | 1.9957 |
| Interquartile range (IQR) | 0.461025 |
Descriptive statistics
| Standard deviation | 0.31599688 |
|---|---|
| Coefficient of variation (CV) | 0.15477128 |
| Kurtosis | -0.20336105 |
| Mean | 2.0417023 |
| Median Absolute Deviation (MAD) | 0.2177 |
| Skewness | 0.54823109 |
| Sum | 5104.2558 |
| Variance | 0.099854031 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.8606 | 4 | 0.2% |
| 1.8491 | 4 | 0.2% |
| 1.7648 | 3 | 0.1% |
| 1.7601 | 3 | 0.1% |
| 1.8275 | 3 | 0.1% |
| 2.6348 | 3 | 0.1% |
| 1.9006 | 3 | 0.1% |
| 1.8176 | 3 | 0.1% |
| 2.2067 | 3 | 0.1% |
| 1.806 | 3 | 0.1% |
| Other values (2227) | 2468 |
| Value | Count | Frequency (%) |
| 1.1487 | 1 | |
| 1.329 | 1 | |
| 1.3744 | 1 | |
| 1.378 | 1 | |
| 1.3822 | 1 | |
| 1.3843 | 1 | |
| 1.3897 | 1 | |
| 1.4161 | 1 | |
| 1.421 | 1 | |
| 1.4259 | 1 |
| Value | Count | Frequency (%) |
| 3.1444 | 1 | |
| 3.0969 | 1 | |
| 3.0759 | 1 | |
| 3.051 | 1 | |
| 3.0374 | 1 | |
| 3.0041 | 1 | |
| 3.0017 | 1 | |
| 2.9988 | 1 | |
| 2.9789 | 1 | |
| 2.9665 | 1 |
compactness
Real number (ℝ)
| Distinct | 1405 |
|---|---|
| Distinct (%) | 56.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.70412052 |
| Minimum | 0.5608 |
|---|---|
| Maximum | 0.9049 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 19.7 KiB |
Quantile statistics
| Minimum | 0.5608 |
|---|---|
| 5-th percentile | 0.6158 |
| Q1 | 0.663475 |
| median | 0.7077 |
| Q3 | 0.7435 |
| 95-th percentile | 0.785605 |
| Maximum | 0.9049 |
| Range | 0.3441 |
| Interquartile range (IQR) | 0.080025 |
Descriptive statistics
| Standard deviation | 0.053066885 |
|---|---|
| Coefficient of variation (CV) | 0.075366196 |
| Kurtosis | -0.50083343 |
| Mean | 0.70412052 |
| Median Absolute Deviation (MAD) | 0.0394 |
| Skewness | -0.062376578 |
| Sum | 1760.3013 |
| Variance | 0.0028160943 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.7073 | 7 | 0.3% |
| 0.7264 | 7 | 0.3% |
| 0.7077 | 6 | 0.2% |
| 0.7414 | 6 | 0.2% |
| 0.7093 | 6 | 0.2% |
| 0.7518 | 6 | 0.2% |
| 0.6175 | 6 | 0.2% |
| 0.7435 | 6 | 0.2% |
| 0.6851 | 6 | 0.2% |
| 0.7356 | 5 | 0.2% |
| Other values (1395) | 2439 |
| Value | Count | Frequency (%) |
| 0.5608 | 1 | |
| 0.567 | 1 | |
| 0.5673 | 1 | |
| 0.5687 | 1 | |
| 0.5698 | 1 | |
| 0.5704 | 1 | |
| 0.5732 | 1 | |
| 0.5753 | 1 | |
| 0.5768 | 1 | |
| 0.5785 | 1 |
| Value | Count | Frequency (%) |
| 0.9049 | 1 | |
| 0.8665 | 1 | |
| 0.852 | 1 | |
| 0.8491 | 1 | |
| 0.8481 | 1 | |
| 0.8474 | 1 | |
| 0.8468 | 1 | |
| 0.8377 | 1 | |
| 0.8374 | 1 | |
| 0.8359 | 1 |
class
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 225.8 KiB |
| Çerçevelik | |
|---|---|
| Ürgüp Sivrisi |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 11.44 |
| Min length | 10 |
Characters and Unicode
| Total characters | 28600 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Çerçevelik |
|---|---|
| 2nd row | Çerçevelik |
| 3rd row | Çerçevelik |
| 4th row | Çerçevelik |
| 5th row | Çerçevelik |
Common Values
| Value | Count | Frequency (%) |
| Çerçevelik | 1300 | |
| Ürgüp Sivrisi | 1200 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| çerçevelik | 1300 | |
| ürgüp | 1200 | |
| sivrisi | 1200 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4900 | |
| e | 3900 | |
| r | 3700 | |
| v | 2500 | |
| Ç | 1300 | 4.5% |
| ç | 1300 | 4.5% |
| l | 1300 | 4.5% |
| k | 1300 | 4.5% |
| Ü | 1200 | 4.2% |
| g | 1200 | 4.2% |
| Other values (5) | 6000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23700 | |
| Uppercase Letter | 3700 | 12.9% |
| Space Separator | 1200 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4900 | |
| e | 3900 | |
| r | 3700 | |
| v | 2500 | |
| ç | 1300 | 5.5% |
| l | 1300 | 5.5% |
| k | 1300 | 5.5% |
| g | 1200 | 5.1% |
| ü | 1200 | 5.1% |
| p | 1200 | 5.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Ç | 1300 | |
| Ü | 1200 | |
| S | 1200 |
Space Separator
| Value | Count | Frequency (%) |
| 1200 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27400 | |
| Common | 1200 | 4.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4900 | |
| e | 3900 | |
| r | 3700 | |
| v | 2500 | |
| Ç | 1300 | 4.7% |
| ç | 1300 | 4.7% |
| l | 1300 | 4.7% |
| k | 1300 | 4.7% |
| Ü | 1200 | 4.4% |
| g | 1200 | 4.4% |
| Other values (4) | 4800 |
Common
| Value | Count | Frequency (%) |
| 1200 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23600 | |
| None | 5000 | 17.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 4900 | |
| e | 3900 | |
| r | 3700 | |
| v | 2500 | |
| l | 1300 | 5.5% |
| k | 1300 | 5.5% |
| g | 1200 | 5.1% |
| p | 1200 | 5.1% |
| 1200 | 5.1% | |
| S | 1200 | 5.1% |
None
| Value | Count | Frequency (%) |
| Ç | 1300 | |
| ç | 1300 | |
| Ü | 1200 | |
| ü | 1200 |
Auto
The auto setting is an interpretable pairwise column metric of the following mapping:- Variable_type-Variable_type : Method, Range
- Categorical-Categorical : Cramer's V, [0,1]
- Numerical-Categorical : Cramer's V, [0,1] (using a discretized numerical column)
- Numerical-Numerical : Spearman's ρ, [-1,1]
This configuration uses the recommended metric for each pair of columns.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.| area | perimeter | major_axis_length | minor_axis_length | convex_area | equiv_diameter | eccentricity | solidity | extent | roundness | aspect_ration | compactness | class | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 56276 | 888.242 | 326.1485 | 220.2388 | 56831 | 267.6805 | 0.7376 | 0.9902 | 0.7453 | 0.8963 | 1.4809 | 0.8207 | Çerçevelik |
| 1 | 76631 | 1068.146 | 417.1932 | 234.2289 | 77280 | 312.3614 | 0.8275 | 0.9916 | 0.7151 | 0.8440 | 1.7811 | 0.7487 | Çerçevelik |
| 2 | 71623 | 1082.987 | 435.8328 | 211.0457 | 72663 | 301.9822 | 0.8749 | 0.9857 | 0.7400 | 0.7674 | 2.0651 | 0.6929 | Çerçevelik |
| 3 | 66458 | 992.051 | 381.5638 | 222.5322 | 67118 | 290.8899 | 0.8123 | 0.9902 | 0.7396 | 0.8486 | 1.7146 | 0.7624 | Çerçevelik |
| 4 | 66107 | 998.146 | 383.8883 | 220.4545 | 67117 | 290.1207 | 0.8187 | 0.9850 | 0.6752 | 0.8338 | 1.7413 | 0.7557 | Çerçevelik |
| 5 | 73191 | 1041.460 | 405.8132 | 231.4261 | 73969 | 305.2698 | 0.8215 | 0.9895 | 0.7165 | 0.8480 | 1.7535 | 0.7522 | Çerçevelik |
| 6 | 73338 | 1020.055 | 392.2516 | 238.5494 | 73859 | 305.5762 | 0.7938 | 0.9929 | 0.7187 | 0.8857 | 1.6443 | 0.7790 | Çerçevelik |
| 7 | 69692 | 1049.108 | 421.4875 | 211.7707 | 70442 | 297.8836 | 0.8646 | 0.9894 | 0.6736 | 0.7957 | 1.9903 | 0.7067 | Çerçevelik |
| 8 | 95727 | 1231.609 | 488.1199 | 251.3086 | 96831 | 349.1180 | 0.8573 | 0.9886 | 0.6188 | 0.7930 | 1.9423 | 0.7152 | Çerçevelik |
| 9 | 73465 | 1047.767 | 413.6504 | 227.2644 | 74089 | 305.8407 | 0.8356 | 0.9916 | 0.7443 | 0.8409 | 1.8201 | 0.7394 | Çerçevelik |
| area | perimeter | major_axis_length | minor_axis_length | convex_area | equiv_diameter | eccentricity | solidity | extent | roundness | aspect_ration | compactness | class | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2490 | 51555 | 934.911 | 401.8321 | 164.7038 | 52013 | 256.2067 | 0.9121 | 0.9912 | 0.7187 | 0.7412 | 2.4397 | 0.6376 | Ürgüp Sivrisi |
| 2491 | 69836 | 1010.605 | 396.6286 | 224.7918 | 70419 | 298.1911 | 0.8239 | 0.9917 | 0.6693 | 0.8593 | 1.7644 | 0.7518 | Ürgüp Sivrisi |
| 2492 | 84236 | 1274.656 | 456.9323 | 237.1540 | 85248 | 327.4944 | 0.8548 | 0.9881 | 0.6104 | 0.6515 | 1.9267 | 0.7167 | Ürgüp Sivrisi |
| 2493 | 58987 | 977.410 | 404.0779 | 186.3710 | 59518 | 274.0522 | 0.8873 | 0.9911 | 0.7327 | 0.7759 | 2.1681 | 0.6782 | Ürgüp Sivrisi |
| 2494 | 79755 | 1146.431 | 470.3888 | 217.8296 | 80649 | 318.6647 | 0.8863 | 0.9889 | 0.7175 | 0.7626 | 2.1594 | 0.6774 | Ürgüp Sivrisi |
| 2495 | 79637 | 1224.710 | 533.1513 | 190.4367 | 80381 | 318.4289 | 0.9340 | 0.9907 | 0.4888 | 0.6672 | 2.7996 | 0.5973 | Ürgüp Sivrisi |
| 2496 | 69647 | 1084.318 | 462.9416 | 191.8210 | 70216 | 297.7874 | 0.9101 | 0.9919 | 0.6002 | 0.7444 | 2.4134 | 0.6433 | Ürgüp Sivrisi |
| 2497 | 87994 | 1210.314 | 507.2200 | 222.1872 | 88702 | 334.7199 | 0.8990 | 0.9920 | 0.7643 | 0.7549 | 2.2828 | 0.6599 | Ürgüp Sivrisi |
| 2498 | 80011 | 1182.947 | 501.9065 | 204.7531 | 80902 | 319.1758 | 0.9130 | 0.9890 | 0.7374 | 0.7185 | 2.4513 | 0.6359 | Ürgüp Sivrisi |
| 2499 | 84934 | 1159.933 | 462.8951 | 234.5597 | 85781 | 328.8485 | 0.8621 | 0.9901 | 0.7360 | 0.7933 | 1.9735 | 0.7104 | Ürgüp Sivrisi |